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Abstract 


Considerable  attention  has  been  paid  to  the  development  of  nonparametric  conditions 
on  P[X_j  =  that  characterize  a  =  1  (locally  independent,  monotone,  unidimen¬ 
sional)  latent  variable  representation  for  the  binary  items  2C/  =  •  •  • ,  Holland 

and  Rosenbaum  (Holland  1981;  Rosenbaum  1984;  Holland  and  Rosenbaum  1986)  focus  on 
the  conditional  association  (CA)  of  subtest  scores  under  strict  unidimensionality,  and  Stout 
(1987,  1990)  treats  the  larger  class  of  essentially  unidimensional  (d^  =  1)  models,  focusing 
on  consistent  estimation  of  0  using  proportion  correct  in  long  tests.  In  the  present 
paper  we  investigate  the  intersection  of  these  two  approaches,  using  Stout’s  principle  that 
any  reasonable  set  of  items  X_j  can  be  embedded  in  an  infinitely  long  sequence  of  items  X 
of  the  same  character. 

We  introduce  three  concepts  which  are  helpful  in  the  search  for  such  a  characterization. 
First,  we  consider  only  representations  which  are  minimally  useful  in  the  sense  that  (1)  the 
latent  trait  0  can  be  consistently  estimated  from  the  item  responses;  (2)  0  is  monotonically 
related  to  the  test’s  “true  score;”  and  (3)  0  is  not  constant  in  the  examinee  population. 
Second,  we  argue  that  the  condition  Cov  {Xi,Xj\Xj)  <  0  is  a  natural  one  to  add  to  CA  and 
df;  =  1  to  ensure  that  the  items  au-e  locally  independent  with  respect  to  0.  Third,  we  show 
that  the  monotonicity  of  the  empirical  ICC’s  P[Xj  =  l|Xj  —  Xj/J]  is  intimately  related  to 
ICC  monotonicity:  this  “manifest  monotonicity”  must  hold  if  d^,  =  1  holds;  and  conversely 
it  can  be  used  to  verify  monotonicity  of  the  usual  ICC’s  Pj{0)  when  df:  =  1  holds. 

We  obtain  a  nearly  complete  nonparametric  characterization  of  useful  djr,  =  1  represen¬ 
tations  in  terms  of  CA,  d^;  =  1,  manifest  monotonicity,  and  the  above  negative  covariance 
condition.  The  negative  covariance  condition  may  not  be  strictly  necessary  for  all  d^,  =  1 
representations,  but  we  show  with  a  small  simulation  study  that  it  probably  does  hold  for 
di,  =  1  models  often  considered  in  practice:  the  Rcisch,  2PL  and  3PL  models. 

Key  Words:  strict  unidimensionality,  essential  unidimensionality,  useful  models,  condi¬ 
tional  association,  negative  association,  empirical  item  characteristic  curves,  monotonicity, 
simulation. 
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1  Introduction 


Item  response  theory,  IRT,  is  a  modern  attempt  to  model — and  statistically  analyze — 
examinee  responses  on  standardized  achievement  or  aptitude  tests.  IRT  modeling  and  anal¬ 
ysis,  which  occurs  at  the  level  of  individual  test  questions — items — is  greatly  facilitated  by 

I 

the  assumption  of  unidimensionality,  i.e.  that  the  latent  trait  ‘‘driving’'^  the  item  responses 
is  a  one-dimensional,  typically  real-valued,  random  variable.  Birnbaum  (1968)  and  Lord 
(1980)  provide  complete  accounts  of  traditional  unidimensional  IRT.  In  this  paper  we  are 
concerned  with  a  general  characterization  of  (the  distributions  of)  item  response  data  for 
which  traditional  unidimensional  IRT  representations  exist. 

For  our  purposes,  a  test  is  simply  a  vector  of  J  items,  or  equivalently  J  binary  (0/1)  item 
response  vai'  bles,^’ 

. _ Xj  =  {XuX2,...,Xj), 

representing  the  correctness  of  responses  of  a  randomly-chosen  examinee  to  the  J  test  items. 
Let  Xj  represent  an  arbitrary  fixed  outcome  of  20?  3.  response  pattern;  an  IRT  model 

makes  assumptions  on  the  conditional  distribution  P\X  ?  =  xj  ISl  =  £]  which  impose  restric¬ 
tions  on  the  marginal  distribution  P[X  j  =  through  the  integral 

P[Xj  =  xj]=  j  P[Kj  =  =  0]  dF[0).  (1) 

Here  F{&)  is  the  sampling  distribution  of  the  latent  trait  or  trait  vector  0  =  (0i, ....  0^)  in 
the  examinee  population  under  discussion;  thus  our  point  of  view  is  similar  to  that  of  Cressie 
Holland  (1983). 

VVe  can  estimate  the  marginal  distribution  P\X ,  =  by  examining  the  response  data 
from  an  actual  test  administration.  We  will  call  this  the  manifest  structure  of  the  test. 

since  it  can  be  identified  to  arbitrary  accuracy  by  increasing  the  manifest  data:  in  this  case. 

% 

by  increasing  the  number  of  examinees  observed.  On  the  other  hand  neither  the  rr.arginal 
distribution  F{^  nor  the  conditional  distribution  P[Xj  =  3lj|0  =  0]  is  directly  obsf'rvable  to 
us,  in  the  sense  that  neither  quantity  is  fully  identifiable  without  also  increasing  the  number 
of  itemsL  These  quantities  determine  the  latent  structure  of  the  test. 

’It  is  possible  in  principle  to  decide  whether  the  Rasch  rnodel  holds,  without  more  items,  using  the  special 
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The  representation  in  (1)  does  not  itself  restrict  the  distribution  of  item  responses  P[X  r  = 
in  any  way.  Standard  IRT  practice  involves  the  imposition  of  additional  conditions  that 
maJce  (1)  a  restrictive,  and  hence  meaningful,  representation.  It  then  becomes  a  meaningful 
and  important  question  to  ask  whether  the  model  so  proposed  “fits”  the  observed  response 
data. 

The  traditional  IRT  assumptions  are  that  local  independence  holds, 

^’ii;=sji0=a  =  n/=.FiA-,  =  xii©  =  a 

and  that  monotonicity  holds, 

Pj{^)  =  P[^j  —  1|0  =  ^]  coordinatewise  nondecreasing  in  6_,  V  j  (M) 

in  the  sense  that  if  <  6^^^  for  all  k  —  1,2,  ...,d  then  <  Pj{0^^^).  When  0  is 

one-dimensional,  Pj{0)  will  be  called  an  item  characteristic  curve,  ICC. 

One  additional  assumption  is  needed  to  make  (1)  restrictive,  namely  that  the  dimension¬ 
ality  d  of  ^  is  much  smaller  than  the  test  length  J  (see  for  exjimple  Holland  and  Rosenbaum, 
1986),  that  is; 

d  <  J.  (D) 

(In  the  development  that  follows,  this  is  formalized  by  requiring  that  d  remain  fixed  ais  J 
grows.)  The  three  assumptions,  LI,  M,  and  D  form  the  foundation  of  item/test  modeling  in 
traditional  IRT.  Appendix  A  gives  examples  to  show  that  if  any  of  these  three  assumptions 
is  completely  omitted  the  resulting  “model”  will  fit  any  distribution  of  binary  data  (hence 
making  it  scientifically  meaningless).  The  least  d  for  which  the  representation  (1)  holds  and 
satisfies  LI  and  M  (and  smoothness  of  the  IRF’s)  we  will  denote  d^.  We  will  refer  to  the 
case  in  which  d^,  =  1  as  the  strictly  unidimensional  Ccise. 

Various  special  cases  of  the  strict  unidimensionality  assumptions  have  been  investigated 
to  see  what  properties  they  imply  for  the  manifest  distribution  P[X j  =  Xj]  through  (1). 
Holland  and  Rosenbaum  (Holland,  1981;  Rosenbaum,  1984;  Holland  and  Rosenbaum,  1986) 

relationship  of  this  model  with  log-linear  models  (c.f.  Cressie  and  Holland,  1983;  Tjur,  1982).  However  this 
does  not  appear  to  be  possible  for  other  models,  nor  for  the  general  question  of  unidimensionalily  prior  to 
parametric  model  selection  which  concerns  us  here. 
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have  shown  that  when  the  general  di,  =  \  assumptions  hold,  the  items  Xj  must  be  con¬ 
ditionally  associated-,  this  shows  that  =  1  is  a  restrictive  and  hence  meaningful  set  of 
conditions.  Cressie  and  Holland  (1983)  (se  also  Tjur,  1982),  have  characterized  the  Rasch 
model  in  terms  of  a  suitably  restricted  log-linear  model  for  P[2Lj  =  ^]-  And  de  Finetti’s 
Theorem  in  classical  probability  theory  may  be  used  to  characterize  an  infinite  sequence  of 
items  with  identical  item  response  functions  by  the  property  that  the  items  themselves  must 
be  exchangeable. 

Stout  (1987;  1990)  capitalizes  on  the  good  6-estimation  properties  of  the  proportion  cor¬ 
rect  score  Xj  =  j  Xj,  when  J  is  large  and  d/,  =  1  holds,  to  produce  a  statistical  test  of 
latent-trait  unidimensionality.  Stout’s  statistical  test  is  tailored  to  his  essential  unidimen¬ 
sionality  condition  (dg  =  1)  which,  in  contrast  to  strict  unidimensionality,  allows  there  to 
be  some  minor  dependencies  among  items  as  well  as  nonmonotonicities  of  individual  ICC's. 

Since  we  will  be  considering  latent  variable  representations  that  are  somewhat  more 
general  than  the  traditional  d^,  =  1  representation,  it  is  worthwhile  to  ask  what  constitutes 
a  “useful”  unidimensional  latent  variable  representation.  In  a  nonparametric  setting  we 
propose  that  such  a  representation  should  satisfy  the  following  definition.  Note  that  what 
we  mean  by  “useful”  here  relates  primarily  to  connecting  an  examinee’s  item  responses 
with  an  estimate  of  or  inference  about  his/her  latent  trait  score.  For  other  purposes,  other 
definitions  might  be  appropriate. 

Definition  1.1  An  IRT  representation,  in  which  LI  may  or  may  not  hold,  will  be  called 
useful  if  and  only  if  the  following  principles  are  satisfied: 


U1  0  can  be  estimated  from  the  observed  values  of  .Vi,  Xj, . . .  ,Xj.  At  minimum 
this  should  mean  that  there  are  functions  tj{xi,. . .  ,xj)  that  consistently 
estimate  0  in  the  sense  that 

tj{X^,...,Xj)^Q 

as  the  test  length  J  grows.  Moreover,  consistent  estimation  should  still  be 
possible  even  though  any  fixed  small  group  of  items  items  (Fi, . . . ,  Fjg)  in  X 
is  dropped  from  X- 
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U2  Examinees  with  higher  0  values  tend  to  score  higher  on  the  test.  A  very 
weak  condition  along  these  lines  is  simply  the  requirement  that  the  average 
ICC  Pj{0)  be  increasing  in  8  for  each  J;  in  other  words 

^[AjIO  =  6\  is  increasing  in  0. 

U3  0  is  useful  for  categorizing  examinees.  In  particular  0  should  be  able  to 
take  on  at  the  very  least  two  distinct  values,  each  with  positive  probability. 

These  principles  are  implicit  in  traditional  IRT  work,  and  are  easily  justified  on  practical 
grounds: 

First,  0  hais  little  statistical  value  as  a  index  of  ability,  achievement,  aptitude,  or  other 
latent  trait,  if  it  cannot  be  estimated;  hence  Ul.  There  is  no  hope  that  0  can  be  estimated 
with  high  precision  unless  J  — >  oo  (e.g.  the  survey  by  Fienberg,  1986),  so  Ul  represents,  in 
some  sense,  a  minimal  estimation  condition.  The  principle  that  estimation  of  0  should  not 
depend  strongly  on  which  particular  items  are  used  is  central  to  what  we  mean  by  “latent 
trait.” 

Principle  U2  reflects  the  interpretation  of  0  as  a  quantity  of  the  latent  trait,  and  of 
the  test  as  an  instrument  for  measuring  that  quantity.  It  will  be  seen  below  (Theorem  2.2) 
that  U2  also  makes  it  easier  for  the  representation  to  satisfy  Ul.  When  specific  parametric 
models  are  constructed  (recently,  e.g.,  Jannarone,  1986;  Sympson,  1987),  there  seems  to  be 
considerable  latitude  available  in  violating  U2  and  still  having  a  representation  which  satisfies 
Ul;  but  for  nonparametric  purposes  and  for  the  purpose  of  interpreting  0,  U2  is  appropriate. 
Note  that  U2  does  not  require  individual  ICC’s  to  be  monotone;  this  requirement  is  only 
made  of  the  test  characteristic  curve. 

Principle  U3  simply  reflects  the  practical  desire  to  use  the  test  to  diagnose,  assess  or 
otherwise  categorize  examinees  and  examinee  populations.  If  there  is  no  variation  in  0  then 
there  is  no  sensible  way  to  use  the  test  in  this  way.  In  terms  of  (1),  U3  asserts  that  the 
prior  0  distribution  does  not  concentrate  at  a  single  0  value. 

Except  for  the  special  cases  of  the  Rasch  model  and  de  Finetti’s  theorem,  no  other 
characterizations  of  =  1  in  terms  of  features  of  the  manifest  structure  P[Xj  =  seem 
to  be  known.  A  general  characterization  is  important  for  several  reasons.  First,  it  allows 
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US  to  better  understand  the  structure,  identifiability  and  meaning  of  the  representation  (1) 
under  di  =  For  example,  a  consequence  of  our  work  here  is  a  better  understanding  of 
how  “far”  each  of  the  CA  and  =  1  approaches  are  from  the  general  dx,  =  1  cissumptions. 
Second,  it  suggests  that  fit  tests  of  the  general  dx,  =  1  representation  are  possible,  without 
resorting  to  specific  parametrizations  of  the  ICC’s,  prior  0  distribution,  etc.  Hence  we  could 
distinguish  between  a  lack  of  fit  due  to  latent  trait  multidimensionality  and  a  lack  of  fit  due 
to  a  poor  choice  of  parametrization. 

In  this  paper  we  review  the  nonparametric  approaches  of  Holland/Rosenbaum  and  Stout, 
and  consider  some  new  conditions  which  bring  us  closer  to  a  general  characterization  of  strict 
unidimensionality.  Throughout  this  paper  we  embed  the  finite  test  Xj  in  an  infinite  sequence 
of  similar  items 

X  =  {XuX2,...). 

LI  and  other  traditional  IRT  properties  extend  in  a  natural  way  to  the  infinite  item  sequence 
X  by  requiring  that  they  hold,  in  a  consistent  fashion,  in  every  finite-length  test  Xj  taken 
from  X- 

In  addition  to  the  CA  and  dx;  =  1  assumptions,  we  argue  that  it  is  natural  to  require 
Cov  (Xj,  XjjXj)  <  0  to  ensure  that  the  items  are  locally  independent  with  respect  to  0.  Also, 
we  show  that  the  monotonicity  of  the  empirical  ICC’s  P[Xj  —  IjXy  —  XjfJ]  is  intimately 
related  to  ICC  monotonicity:  this  “manifest  monotonicity”  must  hold  if  dx,  =  1  holds;  and 
conversely  it  can  be  used  to  verify  monotonicity  of  the  usual  ICC’s  Pj{6)  when  d^  =  1  holds. 
(The  result  that  dx,  =  1  implies  manifest  monotonicity  hcis  been  discovered  independently 
by  I.  Molenaar  (priv.  comm.)  and  is  based  on  Grayson’s  (1988)  monotone  likelihood  ratio 
result  for  proportion  correct  scores.) 

Now  consider  representations  in  which  expected  values  of  the  form  £’[/(X_/)|0  =  0] 
are  continuous,  but  LI  may  or  may  not  hold.  Within  this  class  of  “smooth”  latent  trait 
representations,  we  can  characterize  useful  dx,  =  1  as  follows  (a  more  formal  statement  of 
the  result  is  given  in  Section  4). 

Characterization  of  dx,  =  1.  For  any  infinite  sequence  of  binary  items  X  and  latent  trait 
0,  a  useful  dx,  =  1  representation  (1)  holds,  if  and  only  if  the  following  conditions  hold:  CA. 
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dE  =  I,  manifest  monotonicity,  and 


Cov  (X.,Xj1Xj,0)  <  0,  V  i,j 


(2) 


Moreover  in  many  practical  settings  it  appears  that  the  covariance  in  (2)  continues  to  be 
negative  if  one  omits  the  conditioning  on  0,  because  Xj  is  “nearly  sufficient”  for  0.  In 
the  Rasch  model  for  example,  (2)  is  guaranteed  to  be  negative  when  0  is  omitted.  A  small 
simulation  study  is  reported  in  Appendix  C,  illustrating  similar  behavior  in  other  logistic 
IRT  models.  Thus,  although  (2)  spoils  a  characterization  in  terms  of  the  manifest  structure 
P[X  j  =  alone,  it  appears  we  are  quite  close  in  practical  situations. 


2  Conditional  association  and  essential  independence 

2.1  Conditional  association 

Holland  and  Rosenbaum  have  sought  covariance  conditions,  or  equivalently  probability  in¬ 
equalities,  in  the  distribution  of  2C/  which  must  be  satisfied  if  any  di  =  I  model  applies. 
The  starting  place  for  their  investigations  may  be  taken  to  be  coordinate-wise  nondecreasing 
functions  f{y)  of  finite  subtests  Y_=  (Ti, . . . ,  Yj^)  taken  from  X-  Examples  include 

•  The  weighted  average  /(K)  =  J2i°  ^  —  ^5 

•  The  “all  or  nothing”  score  /(VI)  =  01°  V); 

•  The  “at  least  one”  score  f{Y_)  =  max{yj  :  /  =  1,...,  Jo); 

•  Item  scores  f{Y_)  = 

The  coordinatewise  nondecreasing  functions  are  exactly  those  scoring  methods  which  assign 
more  credit  as  examinees  get  more  answers  correct. 

Under  LI,  any  two  such  scoring  methods  will  be  positively  correlated  at  each  fixed  ability 
level  t)  =  0  (Rosenbaum,  1984):  for  all  finite  subtests  Y_  taken  from  X  and  all  coordinatewise 
nondecreasing  functions  f{y)  and  g{y), 

Covif{Y),g{Y)\Q  =  e)>0,  (3) 
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for  each  possible  d.  This  condition  can  be  converted  into  a  condition  on  the  latent  structure 
into  to  a  condition  on  the  manifest  structure: 

Theorem  2.1  (Rosenbaum,  1984;  Holland  and  Rosenbaum,  1986).  If  X_  satisfies  di  —  1, 
then  2L  Is  conditionally  associated  (CA):  For  every  pair  of  disjoint,  finite  subtests  Y_  and 
Z  in  X_,  every  pair  of  coordinatewise  nondecreasing  functions  f{Y_)  and  g{y_),  and  every 
function  h{Z_), 

Cov  {f(Y.),g(Yi)\h{Z_)  —  c)  >  0  V  c  £  range(/i).  (CA) 

Intuitively,  a  =  1  test  possesses  so  much  internal  coherence  (the  item  responses  are 
driven  monotonically  by  the  single  latent  variable  0)  that  all  reasonable  subtest  scores  must 
be  correlated,  in  any  subpopulation  of  examinees  selected  by  any  criterion  /i(r)  relating  to 
another  part  of  the  test. 

Our  statement  of  Theorem  2.1  is  an  easy  extension  of  Holland  and  Rosenbaum’s  result 
for  finite  length  tests  to  infinite  item  sequences.  Note  also  that  for  finitely  many  binary  (or 
indeed  discrete)  random  variables  . .  ,Zm-,  conditioning  on  a  scalar-valued  function 

h{Z)  is  equivalent  to  Holland  and  Rosenbaum’s  practice  of  conditioning  on  vector-valued 
4(Z).  Seminal  special  cases  of  (3)  and  CA  were  developed  by  Holland  (1981). 

CA  represents  a  wide  variety  of  probability  inequalities  which  can  be  tested  in  the  man¬ 
ifest  distribution  P{2Lj  =  ^]-  When  CA  fails  the  items  cannot  be  treated  as  having  a 
di  =  1  latent  representation.  Apolications  to  studying  the  internal  coherence  of  a  set  of 
items  may  be  found  in  Rosenbaum  (1984)  or  Holland  and  Rosenbaum  (1986).  Related  work 
appears  in  Holland  (1981),  Rosenbaum  (1985),  Rosenbaum  (1987),  and  Rosenbaum  (19SS). 
An  application  of  CA  to  2issessing  the  dimensionality  of  standardized  tests  for  the  National 
Assessment  of  Educational  Progress  is  described  by  Zwick  (1987). 

2.2  Essential  independence 

A  successful  approach  to  identifying  unidimensional  latent  structure  outside  the  strict  d^^  =  1 
framework  has  been  pursued  in  the  seminal  work  of  Stout  (1997,  1990),  and  extended  by 
Junker  (1988,  1991).  The  main  idea,  which  borrows  from  both  the  ‘‘large  sample  theory” 
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tradition  in  mathematical  statistics  and  the  “factor  analysis”  tradition  in  psychometrics,  is 
that  of  essential  independence^ . 

For  any  (infinite)  sequence  of  dichotomous  items  X  =  {Xi,  X2,  ^2,  ■  ■  ■),  define  bounded 
item  scores  to  be  functions  Aj{Xj)  such  that  for  some  M  <  00,  \Aj{Xj)\  <  M  for  all 
j.  We  will  call  a  bounded  item  score  an  ordered  item  score  if  moreover  ^j(O)  < 

Define  a  bounded  test  score  to  be  the  average  of  the  first  J  bounded  item  scores  Aj  = 
j  Finally,  we  will  say  the  ordered  item  scores  are  asymptotically  discriminating 

if  ~  ^j(O)}  is  positive  and  bounded  away  from  0  as  J  00. 

The  infinite  item  sequence  X  is  essentially  independent  (El)  with  respect  to  0  if  and  only 
if 


lim  Var(ylj|0  =  ^)  =  0  (El) 

J—*oo 

for  all  bounded  test  scores  Aj.  When  El  holds,  Aj  is  a  consistent  estimator  of  the  “true 
score”  Aj{9)  =  E[Aj\Q  =  0],  as  J  — +  00.  In  particular,  for  a  sequence  of  dichotomous  items 
A'.  El  implies  that  the  proportion  correct  score  Xj  consistently  estimates  values  of  the  test 
characteristic  function  Pj{d),  as  J  ^  00. 

The  item  sequence  A  is  essentially  unidimtnsicnal,  for  whici  we  shall  write  d£  =  1,  if 
and  only  if  (a)  A  is  El  with  respect  to  a  unidimensional  0;  and  (b)  the  items  are  locally 
asymptotically  discriminating,  LAD;  for  every  set  of  ordered,  asymptotically  discriminating 
item  scores,  the  “true  score”  Aj{6)  is  nondecreasing  in  6,  in  the  strong  sense  that  to  every 
9  there  corresponds  an  interval  Ng  containing  9  and  an  >  0  such  that 


Aj{t)  —  Aj{9) 
t  -  9 


>eg,yt£Ng,t:f^9,\f  J. 


(LAD) 


If  no  such  unidimensional  0  exists,  we  write  d£;  >  1. 

It  is  not  apparent  from  the  above  discussion,  but  Stout  (1987)  and  Junker  (1988,  Section 
3.2),  make  it  clear  that  d^  =  I  can  be  checked  from  the  marginal  distribution  of  A.  A 
statistical  procedure  for  testing  the  hypothesis  that  a  set  of  J  items  Aj  comes  from  a  df;  =  1 
item  sequence  has  been  developed  by  Stout  (1987)  and  refined  by  Stout  and  Nandakumar 
(Nandakumar,  1987,  1990;  Nandakumar  and  Stout,  1990). 

When  d£  =  1,  Aj{9)  may  be  inverted  to  produce  estimates  of  9  directly: 

^Actually,  we  use  Stout’s  strong  essential  independence,  with  some  minor  changes  in  terminology  to  match 
Junker  (1991).  Any  of  the  three  variations  of  El  could  be  used;  cf.  Corollary  2.1  in  Section  2.3. 
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Theorem  2.2  (Stout,  1990).  If  the  item  sequence  X_  satisfies  ds  =  1  with  respect  to  0  then 
for  any  set  of  asymptotically  discriminating  item  scores. 


V  c  >  0,  lim  P 

J— »O0 


=  =  1, 


where  Aj'(u)  is  the  inverse  function  for  the  “true  score”  Aj{6). 


(4) 


Indeed,  under  the  conditions  of  Theorem  2.2  and  some  mild  smoothness  conditions,  the 
maximum  likelihood  estimate  of  6  calculated  as  though  LI  were  true  is  also  consistent  for 
6  (Junker,  1991).  Moreover  the  latent  trait  with  respect  to  which  =  1  holds  is  unique, 
up  to  a  monotone  transformation  (Stout,  1990).  It  is  valuable  to  think  of  El  as  the  greatest 
possible  weakening  of  LI  under  which  Ll-based  trait  estimation/prediction  schemes  could 
be  expected  to  work.  In  this  sense,  the  study  of  El  is  the  study  of  robustness  of  ability 
estimators  to  variations  from  an  LI  latent  structure.  Clarke  and  Junker  (in  progress)  pursue 
this  matter  in  a  more  general  setting. 


2.3  Combining  CA  and  df  =  1 

The  following  lemma  tells  us  that  under  =  1,  for  any  finite  subtest  E  in  Xj,  we 
may  approximate  expected  values  of  the  form  £^[/((K)]0]  with  expected  values  of  the  form 
P[f{y.)\oiJ  S:  Xj  <  fij],  3is  J  oo.  We  assume  for  the  remainder  of  the  paper  that 

P[C\Q  =  6]  is  continuous  in  0  (5) 

whenever  C  is  an  event  depending  on  only  finitely  many  A''’s.  Condition  (5)  implies  that 
E[f{Y.)\Q  =  t]  is  continuous  in  t  for  any  function  /(E)  of  finitely  many  items. 


Lemma  2.1  Suppose  X  satisfies  El  and  LAD  with  respect  to  some  unidimensional  0,  and 
assume  (5).  If  f{Yf)  is  a  function  which  depends  on  only  finitely  many  (fixed)  items  Y_  — 
(Vi, . . . ,  Vjo)  from  X_,  then  for  every  set  of  bounded  asymptotically  discriminating  item  scores 
Aj{Xj)  and  for  each  0  there  exist  ej  —*  0  for  which 


jim  £[/(!;)  \Aj'(Aj)-e\ 

J-^OO  L 


<  (J 


£[/(>:)  I©  =  ^]- 
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proof.  We  give  the  proof  in  the  Ccise  that  0  is  continuous,  but  a  similar  argument  may  be 
given  for  discrete  numerical  0. 

For  any  event  C,  let  Ic  take  the  value  1  if  C  is  true  and  0  if  C  is  false,  and  let  £[/{¥_);  C] 
=  E[f{Y_)lc].  We  may  decompose  the  expectation  on  the  left  above  as 

£[/(Z)||Aj'(Sj)-«|  <e] 

£(.«»;  |0-«l<e|  Elf(y)-.\A;'(Aj)-e\<e\  P||0 -«!<£] 

=  'pl|0-e|<'«|  '■  £[/(!:):  |0-«l<£|  P[\A:'(Aj)-l)\<e] 

=  I(£)  .  11(6)  ■  m(£). 

Note  that  for  any  rate  e  =  ej  0,  I(ej)  — >  E[f{yf)\Q  =  6],  as  J  — >  oo,  using  the  continuity 
condition  (5)  and  the  integral  mean  value  theorem.  The  idea  now  is  to  choose  e  =  ej  — >  0  so 
that  II  — ►  1  and  III  — »  1  as  J  oo.  We  will  look  at  II  explicitly;  note  that  III  is  a  special 
case  of  II.  We  have 

E{fiY)U^e-ei<c)} 

one  can  apply  Theorem  2.2  to  show  that  the  numerator  on  the  right  tends  to  zero  for  each 
fixed  e>0asJ— >oo;a  simple  diagonalization  argument  now  yields  a  rate  ej  -+  0  for  which 
II  — ♦  1.  A  similar  argument  works  for  III,  and  a  further  diagonalization  completes  the  proof. 
□ 

We  can  use  Lemma  2.1  to  gain  information  about  the  latent  structure  of  an  item  sequence 
X  from  the  manifest  condition  CA.  Proposition  2.1  shows  that  CA  and  ds  =  1  together  give 
the  same  local  association  condition  (3)  as  di,  =  1  alone. 

Proposition  2.1  Suppose  the  item  sequence  2L  satisfies  CA  and  df;  =  1,  and  suppose  that 
(5)  holds.  Then  (3)  holds: 

Coy{f{YfigiY)\e  =  0)>O 

for  all  0,  all  coordinatewise  nondecreasing  f  and  g,  and  all  finite  tests  Y_  taken  from  X. 

Remarks.  By  modifying  the  proof  of  Lemma  2.1  we  could  also  conclude  conditional  asso¬ 
ciation  given  0  =  d,  i.e.,  if  Z  were  a  finite  test  from  X  disjoint  from  Y_  and  h{Z)  were  any 
function,  then 

Cov(/(i:),^(r)|/i(z),0  =  d)>o. 
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proof.  Let  Y_  C  2Go  an  arbitrary  finite  test,  for  fixed  Jo,  let  W  =  (A^o+i, -^^0+2)  •  •  ■) 
and,  for  this  proof,  let  PjiO)  =  Using  CA  and  a  sequence  tj  obtained  from 

Lemma  2.1, 

0  <  Cov  [/(i:),y(y:)||?j’(H7^)_<l|<ejj 
-*  Cov{f{Y),g{Y)\e  =  ») 

as  J  — >■  oo.  □ 

Let  us  digress  briefly  to  indicate  another  way  in  which  CA  and  Jg  =  1  interact  well. 
Two  alternative  definitions  of  El  have  been  proposed  by  Stout  (1990),  one  involving  the  full 
sequence  2L  but  taking  absolute  values  of  covariances, 

Jim  (  ^  )■'  |Cov(X.,  =  0,  (6) 

and  another  involving  “nonsparse  subtests”  which,  in  the  present  context,  is  equivalent  to 
considering  only  those  asymptotically  discriminating  item  scores  for  which  -j(0)  =  0  and 
A.j[\)  G  {0,1},  and  requiring 

Jim  (  2  )"'  EE  Cov(>l(A:,),/l(Xi)l«)  =  0.  (7) 

It  is  not  known  in  general  whether  these  three  definitions  are  equivalent.  However,  under 
CA  they  are: 

Corollary  2.1  If  CA  and  LAD  hold  for  X_,  then  all  three  definitions  o/EI  are  equivalent. 

proof.  Condition  (6)  implies  El  as  defined  in  Section  2,  which  in  turn  implies  (7),  since 
each  condition  is  a  special  case  of  the  preceding  one.  For  the  converse  directions,  observe 
that  if  LAD  holds  (for  the  restricted  Ccise  of  Aj{Xj)  €  {0, 1)  Vj),  then  by  Proposition  2.1. 
Cov  (Ai,  Xjl^)  >  0,  Vi,j.  In  this  case,  (6)  becomes  a  special  case  of  (7),  and  we  are  done.  □ 

Returning  to  our  main  development,  the  next  proposition  complements  Proposition  2.1  by 
characterizing  LI  in  terms  of  quantities  that  could  in  principle  be  approximated  by  manifest 
quantities  E[f{Y_)\aj  <  Xj  <  0j]  as  in  Lemma  2.1  under  El  and  LAD. 

Proposition  2.2  The  item  sequence  X_  satisfies  LI  with  respect  to  0  if  and  only  if  the 
following  two  conditions  hold: 
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For  all  d,  all  nondecreasing  f  and  g,  and  all  finite  tests  Y_  taken  from  X , 


Cov(/(y), 5(11)10  =  0)>O; 


(S) 


For  all  6,  i,  and  j, 


Cow{Xi,x,\e  =  e)  <0. 


(9) 


Remarks.  Note  that  (8)  says  that  X_  is  associated  (as  defined  by  Esary,  Proschan  and 
Walkup,  1967),  given  0  =  0,  for  all  6 — indeed  (8)  is  exactly  the  same  as  (3). 


proof.  That  LI  implies  (8)  follows  from  Esary,  Proschan  and  Walkup  (1967);  (9)  is  trivially 
satisfied  under  LI  with  Cov  (X,,  .Yj|0)  =  0.  For  a  proof  of  the  converse,  in  unconditional 
form,  see  Newman  and  Wright  (1981)  or  Joag-Dev  (1983).  □ 


3  Two  new  conditions 

Propositions  2.1  and  2.2  suggest  that  if  we  require  both  CA  and  d£  =  1  in  the  item  sequence 
X,  we  are  not  far  away  from  a  strictly  unidimensional  representation.  Indeed,  by  Proposi¬ 
tion  2.1,  we  know  that  CA  and  dg  =  1  will  guarantee  (8).  In  this  section  we  will  explore 
conditions  on  P[X  i  =  ij]  that  will  guarantee  (9)  also.  We  also  show  that  it  is  possible  to 
check  for  monotone  ICC’s — a  condition  not  explicitly  provided  for  by  CA  and  dg  =  1 — by 
examining  the  “empirical  ICC’s”  P[Xj  =  l|A'^j  —  Xj/J]. 

3.1  CA,  dE  =  1,  and  useful  unidimensional  models 

It  can  be  seen  from  Theorem  2.1  and  the  discussion  following  the  definition  of  essential 
independence  that  CA  and  El  are  both  necessary  conditions  for  the  sequence  of  items  A  to 
have  a  =  1  representation.  In  this  section,  we  consider  several  “thought  examples"  which 
suggest  to  what  extent  these  two  conditions  suffice  to  characterize  d^,  =  1.  We  restrict 
our  attention  to  useful  di  =  \  models,  as  defined  in  Section  1.  Because  LAD  formalizes 
principle  U2  and  Theorem  2.2  satisfies  principle  Ul,  any  d^  =  1  model  in  which  0  varies 
in  the  examinee  population  is  useful  in  the  sense  of  Definition  1.1. 


14 


Example  3.1  CA  is  not  a  guarantee  that  a  useful di,  =  1  model  exists.  Suppose  {Xi,X2,  ■  •  •) 
are  independent  (unconditionally).  Then  CA  follows  from  a  result  of  Esary,  Proschan  and 
Walkup  (1967).  Now,  any  O  that  we  can  estimate  using  principle  U1  will,  according  to 
the  0-1  Law  of  probability  theory  (e.g.  Ash,  1972,  p.  278),  fail  to  vary  from  examinee  to 
examinee.  This  violates  principle  U3;  hence  no  useful  latent  trait  model  for  this  CA  se¬ 
quence  exists.  (Of  course  this  is  no  surprise,  since  a  fully  independent  sequence  of  response 
variables — e.g.  coin  flips — should  intuitively  not  be  able  to  tell  us  anything  useful  about  a 
latent  trait  anyway!)  □ 

Example  3.2  d£;  =  1  is  not  a  guarantee  that  a  useful  di,  =  1  model  exists.  Stout  (1990, 
Example  2.3)  gives  a  model  for  a  sequence  of  “paragraph  comprehension”  questions  which 
is  a  useful  d^  =  1  model.  We  shall  show  that  no  useful  =  1  model  can  be  formulated 
for  this  sequence  of  paragraph  comprehension  items.  Indeed,  if  items  Xi  and  Xj  refer  to 
the  same  reading  passage  we  expect  Cov  (X,-,Xj|0  =  ^o)  ^  0  for  some  Oo-  Now  suppose,  by 
way  of  contradition,  that  there  exists  a  unidimensional  latent  trait  t  with  respect  to  which 
LI  and  LAD  hold;  in  particular  Cov(A^,Xj|T)  =  0.  Now  by  Stout’s  unique  trait  theorem 
(Theorem  3.3  of  Stout,  1990),  r  would  be  a  monotone — indeed  invertible — transformation 
of  0,  r  =  ^(0).  But,  taking  to  =  g(0o), 

0  =  Cov  (ATi,  A'j  |r  =  <o) 

=  Cov(A.,A,l5(0)  =  9(^o)) 

=  Cov(A„X,|0  =  0,) 

#  0. 

This  contradiction  shows  that  no  such  r  can  exist,  i.e.  no  useful  =  1  model  exists  for  the 
sequence  of  paragraph  comprehension  items.  □ 

Example  3.3  CA  and  dg  =  1  together  may  not  guarantee  that  a  useful  d^,  =  1  model 
exists.  Let  A  be  an  item  sequence  satisfying  df;  =  1  with  respect  to  s  me  unidimensional 
0.  We  may  imagine  the  0  in  this  dg  =  1  representation  cis  being  the  first  coordinate  of  the 
latent  trait  vector  0  =  (0,  02,  03, . . . ,  Qd)  needed  for  &  dc  =  d  representation: 

P[2(j  =  xj\Q  =  e]  =  I  P[Xj  =  x^i©  =  e]f{o\Q  =  e)dd 
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for  each  J,  where,  by  LI  with  respect  to  0  =  (0, 02, 03, . . . ,  0<i), 


P[Xj  =  ^|0  =  ^]  =  n  -  Pmf- 

J=1 


In  general,  although  ds  =  I  means  that 


Jim  (  ^  Cov(2.,Z,10  =  «)  =  0 


for  every  nonsparse  subsequence  Z_  from  the  individual  covariances 

Cov(Z.,Z,|0  =  6)  =  Cov(/^(0),Pj(0)|0  =  9) 

may  be  positive  or  negative,  depending  on  how  the  traits  02,  03, . . . ,  0^  interact  with  0. 
Now  suppose  X_  satisfies  CA  also.  Then  by  Proposition  2.1, 


Cov(X.,Xj|0  =  0)>O,Vz>i;  (10) 

in  fact  the  stronger  condition  (3)  holds.  Thus,  not  only  is  0  the  dominant  trait  for 
but  the  “minor  traits”  needed  for  LI  to  hold  are  concordant  with  0,  in  the  sense  that  they 
interact  with  0  so  as  to  keep  the  local  inter-item  covariances  nonnegative. 

Under  CA  and  =  \,  therefore,  there  is  enough  coherence  among  the  items  that 
covariances  between  items,  given  0  =  0,  are  nonnegative.  Indeed  it  is  quite  plausible  that 
under  these  conditions,  for  some  tests,  some  of  the  inequalities  in  (10)  will  be  strict  (despite 
its  plausibility  we  have  not  been  able  to  construct  an  example  in  which  this  may  rigorously 
be  shown).  But  if  any  of  the  inequalties  (10)  are  strict  for  a  dominant  latent  trait  0  with 
respect  to  which  df;  =  1,  then  there  cannot  exist  any  other  unidimensional  trait  r  with 
respect  to  which  a  useful  di  =  \  model  exists;  this  follows  by  the  same  argument  as  in 
Example  3.2.  □ 


3.2  CSN:  A  natural  negative  covariance  condition 

Thus  some  condition  in  addition  to  CA  and  ds  —  I  seems  to  be  needed  to  get  a  useful  di  =  \ 
model.  When  LI  holds,  we  know  that  (3)  holds  also:  Cov (/(y),^(T)|0  =  0)  >  0  for  all 
finite  subtests  Y_  and  nondecreasing  functions  /  and  g.  As  indicated  in  Example  3.3,  there 
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may  be  some  situations  in  which  El  and  CA  hold  but  the  implied  coherence  among  items  is 
so  tight  that  LI  cannot  also  hold.  What  is  needed  is  a  condition  that  loosens  this  coherence 
so  that,  despite  (3),  individual  item  pairs  have  zero  covariance. 

An  interesting  condition  derived  by  Joag-Dev  and  Proschan  (1982)  implies  that  when  LI 
with  respect  to  0  does  hold,  then 

Cov(X.,A:j1Xj,0)  <  0  (LCSN) 

for  all  i  <  j  <  J.  This  says  that,  under  LI,  the  items  are  “not  too  tightly  bound  together” 
even  though  (2)  holds;  each  Xi  and  Xj  are  sufficiently  free  of  one  another  among  examinees 
of  the  same  ability  that  when  Xi  increases  from  one  examinee  to  the  next,  X;  is  free  to 
decreaise  so  that  the  test  score  Xj  may  be  kept  constant.  The  abbreviation  LCSN  stands 
for  locally,  covariances  given  test  scores  are  negative. 

However,  LCSN  is  a  condition  on  the  latent,  not  the  manifest,  structure.  To  obtain  a 
natural  manifest  structure  analogue  to  LCSN  it  is  useful  to  consider  the  special  case  of  the 
locally  independent  Rasch  model.  Here  Xj  is  sufficient  for  0,  i.e.  (Xi,  . . . ,  Xj)  are 
independent  of  0  given  Xj.  One  consequence  of  this  is  that 

Cov(X.,X,|Xj)  =  Cov(X.-,X,|Xj,0),  (11) 

in  which  caise  LCSN  is  equivalent  to  the  manifest  condition 

Cov(X.,Xj|Xj)  <  0  (CSN) 

for  all  i  <  J  <  J.  Here  CSN  should  be  read  2is  covariances  given  test  scores  are  negative.  In 
practice  it  may  be  necessary  to  allow  (CSN)  to  be  violated  for  very  small  values  of  J,  e.g. 
J  <  10. 

Because  Xj  is  not  sufficient  for  0  outside  the  Rasch  model,  CSN  is  an  imperfect  substitute 
for  LCSN.  It  is  valuable  to  know  how  closely  related  LCSN  and  CSN  are  under  LI.  Suppose 
LI  and  hence  LCSN  holds.  To  examine  CSN  one  might  consider  the  decomposition 

Cov(X.,X,|Xj)  =  E  [Cov(X.,X,|Xj,0)|Xj]  +Cov  [e[X.|Xj,  0],  E[X,|Xj,  0]|  Xj]  . 

The  first  term  on  the  right  is  nonpositive,  by  LCSN.  The  second  term  may  be  negative  or 
positive,  but  should  be  small  since  under  =  1  the  (posterior)  distribution  of  0  given  A'j 
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should  have  very  low  variance,  as  J  grows.  Some  preliminary  work  of  B.  Clarke  and  J.  K. 
Ghosh  (Clarke,  priv.  comm.)  points  toward  a  proof  of  this  assertion.  This  suggests  that  in 
most  di  =  I  models 


Cov(x.-,x,|Xj)«f: 


Cov(X.,X,|Xj,0) 


(12) 


for  longer  tests,  even  though  Xj  is  not  sufficient  for  0.  In  particular  when  Cov  (A',,  Xj|A^j) 
fails  to  be  negative  for  a  LI  model,  we  at  letist  expect  it  to  be  near  zero.  Thus  one  might 
consider,  not  CSN,  but  rather  a  condition  like 


Cov(Ai,X,lAj)  <  0  +  cj 


(13) 


for  suitably  chosen  tj  >  0;  this  is  similar  to  including  an  indifference  region  in  a  test  of 
the  null  hypothesis  CSN  against  a  general  alternative.  The  plausibility  of  CSN  and  (13)  in 
locally  independent  two  parameter  and  three  parameter  logistic  IRT  models  is  illustrated  in 
a  small  simulation  in  Appendix  C. 


3.3  MM:  Manifest  monotonicity 

When  one  desires  to  check  M  in  practical  situations,  a  condition  like  the  following  is  often 
used.  Let  Xij  =  X j  ~  X,f  J,  we  will  say  manifest  monotonicity,  MM,  holds  if 

£'[A'',|A’,j]  is  nondecreasing  in  Xij  (MM) 

for  all  i  <  J  (and  all  J).  This  intuitively  appealing  monotonicity  check  is  intimately  related 
to  dc  =  I  latent  structure,  in  that 

(a)  LI,  M  =>  MM; 

(b)  El,  LAD,  MM  =►  M;  and  hence 

(c)  under  LI  and  LAD,  MM  M. 

Assertions  (a)  and  (b)  are  proved  in  Proposition  3.1,  and  (c)  is  an  immediate  corollary.  Mole- 
naar  (priv.  comm.)  has  independently  discovered  (a),  and  we  report  examples  of  Molenaar 
and  Snijder  in  Appendix  B  that  indicate  the  limitations  of  the  method  of  proof  for  (a). 
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Lemma  3.1  If  LI  and  M  hold  for  an  item  sequence  X_  and  latent  trait  0,  then  0  is  stochas¬ 
tically  increasing  in  S  =  Xij: 

W  a  <b\/ c:  P[e  >  c\S  =  a]<  P{e  >  c|5  =  6]  (14) 


whenever  the  conditional  probabilities  are  defined. 

proof.  We  may  apply  Grayson  (1988),  Theorem  2,  to  the  subtest  (Xi,  . . X,_i,  . . .. 

Xj)  to  see  that  the  score  S  =  Xu  has  the  monotone  likelihood  ratio  property 

pf  C  _  U\Q\ 

^ab{d)  =  -577; - nondecreasing  in  6,  ^  a  <  b,  (15) 

.1  [O 

whenever  the  conditional  probabilities  are  defined.  To  establish  (14)  (for  any  score  S  satis¬ 
fying  (15)),  we  may  write  its  left  hand  side  a^ 

r+00 

/  P[S  =  a\e]dF(9) 

=  a]  =  7+00'  ■- - - - 

/  p{s  =  a\e]dF{e) 

J —00 

=  p[r>c] 

where  T  is  a  random  variable  with  density  proportional  to  P[S  =  a\t]  ■  dF{t).  (I.e.,  T  = 
[0|5  =  a].)  On  the  other  hand,  the  right  hand  side  of  (14)  may  be  written  as 

/  p[s  =  b\e]dF(e) 

P[0  >  c|5  =  6]  =  ^ - 


P{d  >  c\S 


/+00 

P{S  =  b\0]dF{0) 

-00 

/•+00 

/  P[S  =  a\0]R,k{0)dF{0) 

^+00  ~ 

/  P[S  =  a\0]RMdF{0) 

J  —00 


_  £'[Ha6(r)l{r>c}] 

-  E\R,,{T)] 

for  the  same  random  variable  T  (where  Ic  is  the  function  that  takes  the  value  1  when  C  is 
true  and  0  otherwise).  Hence  (14)  is  equivalent  to  the  assertion  that 


P[T  >  c]  ■  E[R,t{T)]  <  E[Rab{T)l{T>c}] 

which  follows  from  property  (P3)  of  Esary,  Proschan  and  Walkup  (1967),  since  g{T)  =  1{t>c) 
and  h{T)  =  Rab[T)  are  both  nondecreasing  functions  of  T .  □ 
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Proposition  3.1 

(a)  LI,  M  =>  MM 

(b)  El,  LAD,  MM  and  (5)  M 

proof,  (a)  Note  that 

E[X,\Xu] 


E[E[Xi\Xu,Q] 

e:[p.(0)!x.j] 


a: 


ij 


(16) 


by  LI.  This  last  expectation  is  nondecreasing  in  Xu  by  M  and  Lemma  3.1,  using  a  result  of 
Lehmann  (1955). 

(b)  Let  then  there  exist  sequences  Qj  ^  <  I3^j^  and  Oj  ^  with  <  Qj  ' 

for  all  large  J,  such  that  <  Xu  <  l^j^}  =  ~  ^^’^1  <  from  Lemma  2.1. 

Then 

e;[;c.|0  =  =  limE:[X.la<;)<X.v</5y^] 

J— *00 

<  lim  E[X,\a^p  <  Xu  <  0^^] 

J-^OO 

=  £[X.|0  =  ^<2)] 


where  the  middle  inequality  follows  from  the  fact  that 


£[X,|aS''  <  Xu  <  -lift'- 

Pla'J'  <  Xu  <  ’ 

«(J)  _  _ 

Yl  ^  (,)E[A-.|X.y=c]P[X.^=c] 

=  , _ 

^  (I) 
c=o^ 

-  - 

X:  (,)E[X.|A'.^=c]P[X.j=c] 

<r  _ ,  , _ 


(i)i 


XI  ^  (2) 

(2)  ^  V. 


=  E[XMj<^u<0j 


?(2)l 


(under  MM,  the  second  ratio  of  sums  above  is  a  weighted  average  of  larger  conditional 
probabilities  than  the  first  one).  □ 


Corollary  3.1  Under  LI,  LAD  and  (5)  we  have 


MM  ^  M 
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4  Characterization  of  =  1 

The  previous  two  sections  may  be  summarized  in  the  following  theorem. 


Theorem  4.1  If  2L  is  an  item  seqence  satisfying  di  =  1  and  if  LAD  holds  with  respect  to 
the  latent  trait  0,  then  each  of  the  conditions  CA,  ds  =  1,  LCSN  and  MM  hold. 

We  show  in  this  section  that  the  converse  is  also  true:  the  four  conditions  CA,  =  1, 
LCSN  and  MM  guarantee  a  useful  di  =  I  representation;  hence  these  conditions  characterize 
useful  di  =■  \  representations.  Moreover  the  converse  implication  is  still  true  if  LCSN  is 
replaced  with  its  manifest  analogue  condition  CSN. 

The  main  problem  with  obtaining  these  two  converses  of  Theorem  4.1  is  that  our  negative 
covariance  criteria  CSN  and  LCSN  use  conditioning  on  fixed  values  of  Xj  only,  whereas 
Lemma  2.1  gives  approximations  to  (8)  and  (9)  which  require  conditioning  on  intervals 
otj  <  Xj  <  0j.  The  next  lemma  connects  these  two  forms  of  conditioning.  We  will  assume 
that  for  each  J  and  i  <  J  there  exist  differentiable  gij  such  that 


E[Xi\Xj]=gij{Xj)  I 

sup,,j,„  l5.v(«)i  <  M  <  oo  J 

and  that  for  each  J,  i  <  J,  and  0  there  exist  differentiable  gue  such  that 

E[X,\Xj,e  =  e\  =  gu0{Xj)  1 
sup.,j,„  <Me<oo  j 


(IT) 


(IS) 


The  conditions  (17)  and  (18)  are  maximum  discrimination  conditions  on  item-test  regres¬ 
sions;  most  likely  they  would  be  acceptable  in  practice.  In  particular,  note  that  (17)  and 
(18)  do  not  hide  a  monotonicity  assumption. 

Lemma  4.1  (a)  Suppose  CSN  holds,  and  suppose  (17)  also  holds.  Then  for  any  constants 

Qj  <  0j  for  which  the  covariances  are  defined,  and  for  which  3j  —  ctj  — >  0, 


limsupCov(Ai,  A''j|aj  <  Xj  <  0j)  <  0. 

j—*oo 


(19) 
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(b)  Suppose  LCSN  holds,  and  suppose  (18)  also  holds.  Then  for  any  constants  aj  <  Sj 
for  which  the  covariances  are  defined,  and  for  which  0j  —  aj  0, 

limsupCov(X.,X,|aj  <Xj<  l3j,9)  <  0.  (20) 

J— ►OO 

proof.  We  will  do  part  (a)  only;  part  (b)  is  virtually  identical.  We  have 

Cov{Xi,X,\aj<Xj<0j)  =  E[CoviXi,X,\Xj)\aj<Xj  <0j] 

+  Cov  [E[X,\Xj],  E[X,\Xj]\aj  <Xj<0j\. 

The  first  term  on  the  right  is  evidently  nonpositive  for  J  large.  Let  us  drop  the  conditioning 
on  q;j  <  Xj  <  j3j  from  the  notation  for  brevity;  then  the  second  term  in  (21)  is 

Coy[E[X,\Xj],E[Xj\Xj]]  =  Cov[gu(Xj),g,j{Xj)] 

<  {W^.Tg,J(XJ)■y^ivg,J(XJ)y^, 

by  the  Cauchy-Schwarz  inequality.  Now  applying  Taylor’s  theorem, 

Var^.j(Xj)  <  [max„lg-j(u)lP  •  VarXj 

so  that  conditioning  on  aj  <  Xj  <  fSj,  which  forces  Wa,T  Xj  — >  0  and  hence  Va.T gij{X j)  — +  0 
as  J  — ♦  oo,  also  forces  the  second  term  in  (21)  to  go  to  zero,  completing  the  proof.  □ 

Now  we  are  ready  to  state  and  prove  the  tw'o  converses  to  Theorem  4.1.  The  formal 
statement  of  our  characterization  of  useful  =  1  models  is 

Theorem  4.2  Suppose  X_  is  an  item  sequence  and  0  is  a  unidimensional  trait,  and  suppose 
(5),  (17)  and  (18)  hold.  Then 

(a)  CA,  dE  =  I,  LCSN,  MM  =  1,  LAD 

(b)  CA,  ds  =  1,  CSN,  MM  ^dL  =  l,  LAD 

Remarks.  In  the  implications  “=>”  in  (a)  and  (b),  0  is  the  trait  with  respect  to  which 
dg  =  I  holds,  and  the  theorem  asserts  that  in  fact  d^,  =  1  holds  with  respect  to  this  0.  In 
the  implication  “4=”  in  (a),  0  is  the  trait  with  respect  to  which  di  =  1  holds.  In  both  cases, 
0  is  unique  up  to  monotone  transformation  (Stout,  1990,  Theorem  3.3). 
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proof.  It  is  more  convenient  to  prove  (b)  first. 

Part  (b),  There  are  three  conditions  to  check  on  the  right:  LI,  M,  and  LAD. 

LAD  follows  from  df;  =  1  by  dehnition.  M  follows  from  MM  and  df;  =  1  via  Proposi¬ 
tion  3.1(b).  LI  follows  from  CA,  ds  =  1  and  LCSN,  using  Proposition  2.1,  Proposition  2.2, 
and  Lemma  4.1(a),  since  (19)  implies  that  under  CSN  and  d^  =  1  we  have  Cov(A^,  Xj  |0)  < 
0  for  all  t,  j  and  0. 

Part  (a),  Again  we  must  check  LI,  M  and  L.AD.  LAD  and  M  follow  as  before. 

LI  follows  again,  using  Proposition  2.1,  Proposition  2.2,  and  Lemma  4.1(b),  since  now  (20) 
implies  that  under  LCSN  Cov  (A"",,  Aj|d)  <  0  for  all  i,  j,  and  6  (a  conditional  [given  0  =  0] 
form  of  Lemma  2.1  is  needed  to  show  this,  but  this  is  straightforward). 

Part  (a),  This  is  Theorem  4.1,  but  we  state  the  proof  for  completeness.  We  must 

check  MM,  CA,  El,  LAD  and  LCSN.  MM  and  CA  follow  from  di,  =  1  by  Proposition  3.1(a) 
and  Theorem  2.1,  respectively.  El  follows  from  LI  trivially,  LAD  is  assumed  on  the  right, 
and  LCSN  follows  from  LI  via  a  conditional  form  of  Theorem  2.8  of  Joag-Dev  and  Proschan 
(1982).  □ 

Theorem  4.2(a)  gives  a  complete  characterization  of  d^,  =  1  representations  among 
“smooth”  representations  satisfying  the  mild  monotonicity  condition  LAD — this  is  essen¬ 
tially  the  cla^s  of  useful  di  =  1  representations,  assuming  that  the  distribution  of  0  is  not 
concentrated  at  one  point.  The  characterization  is  of  interest  because  the  conditions  MM, 
CA  and  El,  are  all  conditions  on  the  manifest  structure  P[Xj  =  of  A-  LCSN  is  not  itself 
a  “manifest”  condition,  but  it  appears  to  be  quite  close  to  its  natural  manifest  structure 
analogue  CSN  in  practice.  Theorem  4.2(b)  gives  reasonably  general  conditions  on  the  man¬ 
ifest  structure  of  A  which  are  sufficient  to  guarantee  a  “useful”  d^  =  1  representation.  Note 
that  overly  restrictive  assumptions,  such  as  detailed  knowledge  of  the  forms  of  the  ICC's  or 
of  the  distribution  of  0,  are  not  needed  in  this  approach. 

It  is  important  to  point  out  that  Theorem  4.2  is  possible  only  through  the  use  of  an 
infinite  item  sequence  A.  The  constraints  put  on  the  latent  structure  of  a  finite  set  of  items 
X  I  =  ( A^i , . . . ,  Xj)  by  the  distribution  of  2fj  are  not  strong  enough,  in  general,  to  guarantee 
a  particular  form  for  the  latent  structure.  The  principal  difference  between  the  finite  case 
X  f  and  the  infinite  case  A  is  that,  whereas  a  latent  trait  may  be  only  imperfectly  estimated 
using  it  can  be  known  with  complete  accuracy  using  .A  (see  Levine,  1985,  for  a  related 
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discussion  about  the  limits  of  our  ability  to  know  G  from  20  for  finite  J).  Thus  the  use 
of  (conceptually)  infinite  sequences  of  items  seems  absolutely  vital  to  clarify  model-building 
and  model-identification  issues  (see  also  Stout,  1990,  for  a  discussion  of  this  point). 

5  Discussion 

In  this  paper  we  have  combined  the  conditional  association  (CA)  approach  of  Holland  and 
Rosenbaum  and  the  essential  unidimensionality  [d£  =  1)  approach  of  Stout  to  produce  a 
nearly  complete  nonparametric  characterization  of  useful  di  =  I  representations  for  dichoto¬ 
mous  IRT  data.  The  three  principles  Ul,  U2  and  U3  for  a  useful  representation  require 
that  the  latent  trait  0  can  be  consistently  estimated  from  the  item  responses:  that  0  is 
monotonically  related  to  the  test’s  “true  score;”  and  that  0  is  not  constant  in  the  examinee 
population. 

In  Section  2  we  reviewed  the  CA  and  dg  =  1  conditions.  A  crucial  feature  of  our 
analysis  was  the  embedding  of  the  finite-length  test  ^  into  an  infinite  sequence  of  items 
X_  which  extends  the  features  of  the  observed  set  of  items  Xj.  This  embedding  is  needed 
for  two  reasons:  first,  several  authors  have  observed  that  estimation  of  traits  to  arbitrary 
precision — and  hence  identification  of  latent  structure — cannot  be  expected  to  work  unless 
the  test  length  is  allowed  to  grow  without  bound;  and  second,  the  embedding  is  needed  to 
discuss  Stout’s  notions  of  essential  unidimensionality.  Moreover  in  many  practical  settings, 
the  embedding  can  be  justified  as  a  continuation  of  the  usual  process  used  to  manufacture 
items  (c.f.  Stout,  1990).  Under  CA,  all  three  definitions  of  df;  =  1  proposed  by  Stout  are 
equivalent,  and  CA  and  ds  =  I  together  bring  us  close  to  a  =  1  representation. 

In  Section  3  we  developed  two  further  conditions  which  seem  to  be  needed  to  obtain  a 
characterization  of  strict  unidimensionality.  First,  the  negative  covariance  condition  CSN 
(Cov  (A,,  AjIA'j)  <  0)  is  a  natural  one  to  add  to  CA  and  d^  =  1  to  ensure  that  the  items 
are  locally  independent  with  respect  to  0.  CSN  is  guaranteed  to  be  true  in  the  Rasch 
model,  and  a  simulation  in  Appendix  C  shows  that  CSN  is  also  plausible  in  tw’o- parameter 
and  three-parameter  logistic  models.  Second,  monotonicity  of  the  empirical  ICC’s  TfAj  = 
l|A^j  —  Xj/J]  is  intimately  related  to  ICC  monotonicity:  this  “manifest  monotonicity”  must 
hold  if  di,  =  1  holds;  and  conversely  it  can  be  used  to  verify  monotonicity  of  the  usual  ICC's 


24 


P]{0)  when  dg  =  I  holds. 

In  Section  4  we  showed  that  useful  =  1  representations  may  be  characterized  by  the 
conditions  CA,  df;  =  1,  manifest  monotonicity,  and  a  local  version  of  CSN.  If  the  local  CSN 
condition  is  replaced  with  CSN  itself,  we  obtain  a  fairly  general  set  of  sufficient  conditions 
on  the  manifest  structure  P[X  ,  =  X  r]  as  J  oo,  for  a  useful  d£,  =  1  representation  to 
hold.  These  conditions  have  the  agreeable  property  that  parametric  forms  of  the  ICC's  are 
not  needed  to  check  d^,  =  1. 

It  is  important  to  note  that  some  form  of  monotonicity  or  ICC  smoothness  is  needed  to 
avoid  meaningless  models;  an  example  in  Appendix  A,  due  to  Suppes  and  Zanotti  (1981), 
illustrates  this.  Our  preferred  “nonparametric”  condition  has  been  Stout’s  local  asymptotic 
discrimination  LAD  condition.  In  parametric  settings,  a  general  monotonicity  condition  such 
as  LAD  might  be  dropped  in  the  face  of  other  smoothness  available  from  the  parametric  form 
of  the  model.  However  such  a  condition  is  often  plausible,  even  if  the  ICC’s  are  not  monotone, 
and  greatly  enhances  the  interpretability  oi  the  model. 

The  results  of  this  paper  bring  us  quite  close  to  a  characterization  of  useful  di  =  I 
representations  solely  in  terms  of  the  distribution  of  the  manifest  item  response  data.  This  is 
valuable  for  two  reasons.  First,  it  is  hoped  that  presenting  them  stimulates  further  discussion 
of  the  bcLsic  components  of  IRT  modeling.  For  example,  the  local  form  of  CSN  in  the 
characterization  above  suggests  that  both  a  positive  covariance  condition  like  CA,  and  a 
negative  covariance  condition  like  CSN  seem  needed  to  properly  understand  the  general 
d[^  =  1  assumptions.  Second,  such  a  characterization  suggests  practical  nonparametric 
tests  of  fit  for  the  general  d[^  =  1  representation.  But  the  practical  application  of  the 
characterization  theorem  requires  one  to  deal  with  a  potentially  difficult  multiple  inference 
problem:  testing  CSN  involves  combining  J  •  ^  ^  statistical  tests,  and  CA  involves  even 

more  statistical  tests.  Understanding  this  multiple-inference  problem  and  exploiting  possible 
dependencies  among  the  tests  to  reduce  the  problem  is  an  important  future  step  in  this 
research. 
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A  The  “necessity”  of  LI,  M  and  D 


The  assumptions  LI,  M  and  D  can  be — and  sometimes  are — weakened  but  not  dropped:  if 
any  one  of  them  is  completely  dropped,  the  resulting  version  of  (1)  can  be  made  to  fit  any 
distribution  of  dichotomous  items.  Two  of  the  examples  which  show  this  are  known  in  the 
literature,  but  we  repeat  them  here  for  completeness.  Example  A. 3  appears  to  be  new. 

Example  A.l  d  =  I  and  LI  hold,  but  M  dropped.  (Suppes  and  Zanotti,  1981).  Here 
we  allow  the  ICC’s  to  be  arbitrarily  rough  and  nonmonotone;  our  goal  is  to  represent  an 
arbitrary  distribution  P[X  t  =  ij]  as  in  (1).  Let  0  =  Yli  i.e.  0  is  the  base-two 

fraction  O.X1X2  . .  .Xj.  Then  the  ICC’s  may  be  written  as 

P[Xj  f|^]  f  {Int  (2.'-S)  mod  2=1 }  1 


where  “Int  (t)"  is  the  integer  part  of  t,  and  Ic  equals  1  when  C  is  true  and  0  when  C  is  false. 
The  likelihood  may  be  written  as  usual  as  under  LI,  and  the  distribution  of  0  is  described 
by  a  probability  mass  function 

0,  if^^. 

Finally,  (1)  becomes 


p|i;  =  xyi=  E 


This  example  can  be  modified  to  allow  infinitely  many  items,  items  with  more  than  two 
response  categories,  and  items  with  continuous  responses.  □ 


26 


Example  A. 2  LI  and  M  hold,  but  d  =  J .  (Holland  and  Rosenbaum,  1986;  Stout,  1990). 
To  write  the  representation  (1)  in  this  setting,  we  put  0  =  X j.  so  that  each  component 
6j  =  Xj  records  exactly  the  response  to  the  item.  The  ICC’s  may  be  written  as 

p[x,  =  iia  = 

which  are  certainly  coordinatewise  nondecreasing  in  the  multivariate  distribution  of  0  is 
exactly  the  same  cls  for  X  r. 

f{d)  =  P[Q  =  £]  =  P{Kj  =  £]• 


Here,  (1)  becomes 

=  2ji  =  E  n  mr’O  - 

e  j=i 

Example  A. 3  d  —  1  and  M  hold,  but  LI  dropped.  Here  we  set  0  =  H/  ^j-  The  ICC’s 


P[X,  =  1|0]  = 


1,  if  0  =  1; 
0,  if  0  =  0 


remain  monotone,  but  the  joint  likelihood  cannot  be  written  as  under  LI.  Instead, 


P[Kj  =  Xj], 
P[Kj  =  Xj], 
0, 


if  0  =  0  and  Xj  < 
if  0  =  1  and  Yli  X;  =  J', 
otherwise 


and  the  probability  mass  function  for  0  is 


f{0)  =  p[Y:ix,  =  jYPizix,  < 


Finally,  the  representation  (1)  is  simply  written  as 


e=o 

It  is  worth  remarking  that  each  of  these  examples  is  very  much  degenerate,  as  a  piece  of 
psychometric  modeling.  In  Example  A.l  and  Example  A. 2  the  latent  variables  chosen  are 
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not  latent  at  all,  in  that  they  are  completely  determined  by  the  examinee’s  responses  (and 
thus  represent  no  broader  sense  of  a  psychological  construct  than  the  examinee’s  responses 
themselves).  In  Example  A. 3  the  problem  is  the  “opposite”:  here  6  captures  virtually  nothing 
about  the  examinee’s  response  behavior.  The  examples  should  be  taken  as  indications  of 
how  quicky  the  modeling  process  can  devolve  if  the  assumptions  LI,  M  and  D  are  weakened 
too  much. 


B  Limits  on  generalizing  MM 


Molenaar  (priv.  comm.)  has  independently  discovered  Proposition  3.1(a),  and  reports  coun¬ 
terexamples  indicating  the  limitations  to  extensions  of  this  result.  Example  B.l  shows  that 
the  method  of  proof  of  Lemma  3.1  cannot  be  extended  to  the  polytomous  case.  Exam¬ 
ple  B.2  shows  that,  in  Proposition  3.1(a),  we  cannot  replace  the  “delete  average”  Xu  = 
7  Li  ~  ^i/J  with  the  more  natural  average  over  all  items  Xj. 


Example  B.l  (I.  Molenaar) .  The  monotone  likelihood  ratio  property  (15)  of  Grayson 
(1988)  does  not  extend  to  polytomous  items.  Let  0  <  0  <  1  and  consider  a  single  graded- 
response  item  X  taking  the  three  values  0,  1  or  2,  with 


P[X>l\e]  = 


P[X  >  2\6]  = 


I  30,  0  <  0  <  1/4 

I  I  +  i0,  1/4  <  0  <  1 
0,  0  <  0  <  1/4 

<  3(0 -i),  l/4<0<l/2 

,  i  -k  10,  1/2  <  0  <  1 


In  this  case,  for  0o  =  1/4  and  0i  =  1/2,  the  likelihood  ratio 


P[X  =  x\0o] 
P[X  =  a:10,] 

is  not  monotone  in  x.  For  one  may  calculate 


00  =  1/4 

01  =  1/2 

Ratio 

P[X  =  O|0] 

1/4 

1/8 

2 

II 

3/4 

1/56 

42 

II 

to 

0 

6/7 

0 

(22) 
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Here  the  ratio  (22)  is  not  monotone  in  x.  A  consequence  of  (15)  in  the  proof  of  Lemma  3.1 
would  be  that  the  likelihood  ratio  (22)  is  monotone  in  x;  since  it  is  not,  the  proof  of  Lemma  3.1 
cannot  be  extended  to  the  polytomous  case.  □ 


Example  B.2  (T.  Snijder).  P[Xj  =  IjA^j  =  s]  need  not  increase  with  s,  and  hence  we 
may  not  replace  Xij  with  Xj  in  Proposition  3.1(a).  Consider  three  dichotomous  item,  and 
a  two-point  distribution  for  0,  P(0  =  6q)  =  P(0  =  0i)  =  |.  Let 


P,(eo)  =  £,i  =  1,2,3; 
PM)  =  j; 

.^2(^1)  =  1  —  c;  and 
P3(^i)  =  1-e. 


It  follows  that,  as  c  — >  0, 


p[a:i  =  i|a^  =  i/3]  = 


(l-er-h^e  1 
3(1  -c)2-|-  1  -  ie  ^  4 


=  l|.Vj  =  2/31  = 


2  + 


Hence  for  small  e  >  0,  P[Ai  =  l\Xj  =  1/3]  >  P[Xi  =  l\Xj  =  2/3].  □ 


C  An  illustration  of  CSN 


We  have  argued  in  Section  3.2  that  the  CSN  condition 

Cov(A.,A0lXy)  <  0 


is  a  reasonable  one  to  look  for  in  dichotomous  test  data,  to  ensure  that  the  local  (given 
Q  =  6)  association  between  items  is  not  too  strong,  under  CA,  for  LI  to  hold. 

To  illustrate  the  plausibility  of  CSN  in  common  d^  =  I  IRT  models,  a  small  simulation 
study  was  performed.  Item  characteristic  curves  were  taken  to  be  of  the  logistic  form 


Pj{^)  =  Cj  +  (1  -  C;) 


_ 1 _ 

1  -I-  exp{-1.7aj(<?  -  6j)} 


(23) 
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All  three  of  the  popular  ICC  models  based  on  (23)  were  considered:  the  Rasch  model  (one 
parameter  logistic,  IPL)  in  which  Cj  =  0  and  aj  =  1;  the  two  parameter  logistic  model  (2PL) 
in  which  Cj  =  0;  and  the  three  parameter  logistic  (3PL)  model  in  which  all  parameters  are 
free.  Free  parameters  were  generated  as  appropriate  for  each  model  as  follows. 

•  Cj  =  0.5  +  0.5(j  mod  3)  +  aj,  where  aj  is  Af(0, 0.0625)  [normal,  mean  zero,  variance 
0.0625]  noise,  truncated  so  that  0.5  <  aj  <  1.5; 

•  bj  =  —2.0  +  4.035y  +  where  /?_,  is  A^(0,  jj)  noise; 

•  Cj  =  0.2  +  7j,  where  7^  is  A^(0,0.01)  noise,  truncated  so  that  0.0  <  Cj  <  0.4. 

For  each  of  the  three  models,  a  2000-examinee  test  administration  was  simulated,  for  varying 
test  lengths  J.  Examinee  abilities  were  sampled  from  a  A’^(0, 1)  distribution  and  responses 
were  generated  according  to  the  corresponding  locally  independent  IRT  model. 

The  Mantel-Haenszel  (M-H)  one-sided  z-test  for 

Ho:  CoviX„X,\Xj)  <  0  vs.  Hu  Cov (X^, XjjXj)  >  0 

combined  across  Xj  categories,  was  performed  for  each  pair  i,  j.  The  number  of  tests  for 
which  the  Mantel-Haenszel  z  exceeds  1.28  (corresponding  to  a  nominal  level  p  <  0.10)  is 
tallied,  and  these  particular  tests  are  displayed  in  detail. 

Note  that,  since  ^  2  )  statistical  tests  are  performed — one  for  each  pair  {i.j) — even  if 
Ho  is  true,  one  would  normally  expect  some  significantly  positive  covariances  because  of 
‘‘capitalization  on  chance.”  Hence  there  is  a  severe  multiple  inference  problem,  paralleling 
a  similar  problem  with  the  CA  condition.  Understanding  this  multiple-inference  problem 
and  exploiting  possible  dependencies  among  the  tests  to  reduce  the  problem  is  an  important 
future  step  in  this  research. 

The  results  for  the  Rasch  model  are  displayed  in  Table  1  and  Table  2.  Since  LCSN  and 
CSN  are  equivalent  under  the  Rasch  model.  Ho  is  always  true;  hence  the  nine  large  M-H  r's 
in  Table  1  are  exclusively  due  to  Type  I  errot  (capitalization  on  chance).  Hence  Table  1  may 
be  taken  as  a  baiseline:  if  in  another  model  the  number  of  large  M-H  c's  is  smaller  than  for 
the  Rasch  model,  we  may  be  confident  in  Ho,  and  hence  (13). 
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The  results  for  the  2PL  simulation  are  displayed  in  Tables  3  and  4,  and  for  the  3PL 
simulation  in  Tables  5  and  6.  Remarkably,  fewer  large  M-H  z’s  were  found  for  these  models — 
where  the  equivalence  of  LCSN  and  CSN  is  not  known  theoretically — only  two  large  M-H  z's 
were  found  for  the  2PL  simulation,  and  four  were  found  for  the  3PL  simulation.  Moreover, 
there  are  no  large  M-H  r’s  for  the  longer  tests,  suggesting  the  validity  of  (12)  as  J  — >  oo. 


J 

#  M-H  tests 

^  M-H  r’s  above  1.28 

10 

45 

0 

20 

190 

0 

40 

780 

9 

80 

3160 

0 

Table  1:  Number  of  large  positive  associations  for  2000-examinee  Rasch  simulation  ( J  =  test 
length). 


i 

J 

M-H  z 

Nominal  p 

^MH 

In(dAfH) 

32 

7 

1.403 

0.080 

1.634 

0.491 

36 

1 

1.779 

0.038 

3.172 

1.154 

36 

8 

1.434 

0.076 

1.640 

0.495 

37 

5 

1.497 

0.067 

2.142 

0.762 

37 

7 

2.401 

0.008 

6.083 

1.805 

37 

27 

1.347 

0.089 

1.295 

0.259 

38 

7 

1.502 

0.067 

2.539 

0.932 

39 

6 

1.408 

0.080 

1.196 

0.179 

40 

1.503 

0.066 

3.343 

1.207 

Table  2;  Details  for  40-item  Rasch  simulation  (qa///  =  est.  common  odds  ratio  across  X j 
categories). 
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J 

M-H  tests 

#  M-H  z's  above  1.28 

10 

45 

1 

20 

190 

1 

40 

780 

0 

80 

3160 

0 

Table  3:  Number  of  large  positive  associations  for  2000-examinee  2PL  simulation  ( J  =  test 
length). 


J 

i 

i 

M-H 

Nominal  p 

ln(Q!Af//) 

10 

1 

1 

1.683 

0.046 

2.981 

1.092 

20 

18 

4 

1.460 

0.072 

2.227 

0.801 

Table  4:  Details  for  10-  and  20-item  2PL  simulations  {oimh  =  est.  common  odds  ratio  across 
Xj  categories). 


J 

#  M-H  tests 

#  M-H  2’s  above  1.28 

10 

45 

0 

20 

190 

4 

40 

780 

0 

80 

3160 

0 

Table  5:  Number  of  large  positive  associations  for  2000-examinee  3PL  simulation  ( J  —  test 
length). 


i 

a 

M-H  2 

Nominal  p 

OMH 

In(dAfH) 

3 

1 

mgmm 

4 

1 

■EH 

1.654 

■i9 

4 

3 

1.777 

■EH 

1.300 

20 

1 

1.568 

1.411 

0.344 

Table  6:  Details  for  20-item  3PL  simulation  (oa///  =  est.  common  odds  ratio  across  X j 
categories). 
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